Algorithmic scheme for hybrid computing with CPU, Xeon-Phi/MIC and GPU devices on a single machine
نویسندگان
چکیده
In this paper, we address the problem of the efficient parallel exploitation of different types of computing devices inside a single machine, to solve a scientific problem. As a first step, we apply our scheme to the Jacobi relaxation. Despite its simplicity, it is a good example of iterative process for scientific simulation. Then, we evaluate and analyze the performance of our parallel implementation on two configurations of hybrid machine.
منابع مشابه
Performance optimizations for scalable CFD applications on hybrid CPU+MIC heterogeneous computing system with millions of cores
For computational fluid dynamics (CFD) applications with a large number of grid points/cells, parallel computing is a common efficient strategy to reduce the computational time. How to achieve the best performance in the modern supercomputer system, especially with heterogeneous computing resources such as hybrid CPU+GPU, or a CPU + Intel Xeon Phi (MIC) co-processors, is still a great challenge...
متن کاملGaraph: Efficient GPU-accelerated Graph Processing on a Single Machine with Balanced Replication
Recent advances in storage (e.g., DDR4, SSD, NVM) and accelerators (e.g., GPU, Xeon-Phi, FPGA) provide the opportunity to efficiently process large-scale graphs on a single machine. In this paper, we present Garaph, a GPU-accelerated graph processing system on a single machine with secondary storage as memory extension. Garaph is novel in three ways. First, Garaph proposes a vertex replication ...
متن کاملStatus of Archer — a Monte Carlo Code for the High-performance Heterogeneous Platforms Involving Gpu and Mic
Accelerators such as Graphics Processing Units (GPUs) and Many Integrated Core (MIC) coprocessors are advanced computing devices with outstandingly high computing performance and energy efficiency. The Monte Carlo transport simulation community views these advanced devices as an opportunity to effectively reduce the computation time for performance-critical applications. In this paper, we repor...
متن کاملConcurrent Cpu, Gpu and Mic Execution Algorithms for Archer Monte Carlo Code Involving Photon and Neutron Radiation Transport Problems
ARCHER-CT and ARCHER-Neutron are Monte Carlo photon and neutron transport applications that have now been updated to utilize CPU, GPU and MIC computing devices concurrently. ARCHER detects and simultaneously utilizes all CPU, GPU and MIC processing devices that are available. A different device layout and load-balancing algorithm is implemented for each Monte Carlo transport application. ARCHER...
متن کاملPerformance Analysis and Efficient Execution on Systems with multi-core CPUs, GPUs and MICs
We carry out a comparative performance study of multi-core CPUs, GPUs and Intel Xeon Phi (Many Integrated Core MIC) with a microscopy image analysis application. We experimentally evaluate the performance of computing devices on core operations of the application. We correlate the observed performance with the characteristics of computing devices and data access patterns, computation complexiti...
متن کامل